Nucleic Acid Molecule, comprising a Nucleic Acid Coding for a Polypeptide 
with Chorismate Mutase Activity 

The present invention concerns a nucleic acid molecule, comprising a nucleic acid coding for 
a polypeptide with chorismate mutase activity and derivatives thereof, wherein the 
derivatives show at least 10% of the chorismate mutase activity of the chorismate mutase 
according to SEQ ID NO: 2. The invention further concerns vectors containing nucleic acid 
molecules, host cells containing nucleic acid molecules and processes for the production of 
polypeptides with chorismate mutase activity. The present invention further concerns the 
polypeptides with chorismate mutase activity and antibodies which specifically recognise 
these. In addition, the invention concerns processes for the production of auxotrophic yeast 
strains by means of the nucleic acid molecules and the use thereof in processes for the 
recombinant expression of heterologous genes. 

As single-cell, eukaryotic microorganisms, yeasts can be cultured without difficulty and have 
the great advantage of being readily genetically manipulate. Also, they can process and 
modify recombinant proteins in accordance with the patterns known from higher organisms. 
As far as is yet known, they contain no pathogenic substances and are therefore also suitable 
for the production of therapeutic proteins. Thus for example the first vaccine produced by 
heterologous gene expression, the hepatitis B vaccine, was heterologously expressed in the 
well-characterised baker's yeast Saccharomyces cerevisiae (Lepetic et al., 1996). 

While S. cerevisiae makes it possible to produce many different proteins (for a review, see 
Gellissen and Hollenberg, 1997), there are nonetheless also a few limiting properties. Thus 
for example the maximal content of heterologous protein is about 1-5% of the total protein 
content of the cell (Buckholz and Gleeson, 1991). 

Various other yeasts, such as for example Schizosaccharomyces pombe, Kluyveromyces 
lactis, Yarrowia lipolytica and also Hansenula polymorpha have been characterised and 
compared with S. cerevisiae in terms of their suitability as expression systems (Muller et al., 
1998). All these yeasts showed markedly stronger secretion of active protein than S. 
cerevisiae, the heterologous gene expression being dependent on the particular gene, but 
independent of the donor organism. Basically, methylotrophic yeasts are found to be 
attractive organisms for the production of recombinant proteins. The methylotrophic yeasts 
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subdivided into the four genera Hansenula, Pichia, Candida and Torulopsis, which all 
possess the ability to utilise methane, methylamine, formaldehyde or formate as carbon and 
energy source. 

Inter alia, the yeast Hansenula polymorpha from the Saccharomycetaceae family (Lodder, 
1970) belongs to the relatively small group of the methylotrophic yeasts. Hansenula 
polymorpha does not ferment glucose under aerobic conditions, and is thus Crabtree-negative 
(Verduyn et al., 1992) and with a growth temperature optimum of 37°C is classed among the 
thermotolerant yeasts; thus it is an exception among the methylotrophic genera. The natural 
habitat of the methylotrophic yeasts is locations rich in organic material. 

Hitherto, only a few genes of the yeast Hansenula polymorpha had been cloned and charact- 
erised (Hansen and Hollenberg, 1996), and according to GenBank the number of cloned 
genes in Hansenula polymorpha amounts to > 30. However, how many of these are 
thoroughly characterised or are suitable as marker genes for plasmid selection, is unknown to 
me. Hence, I would suggest that similarities of these genes with the homologous genes in 
Saccharomyces cerevisiae, insofar as these are present, are relatively limited (Dobson et al., 
1982). The key enzymes in methylotrophic metabolism are the enzymes methanol oxidase 
(MOX), dihydroxyacetone synthase (DAS) and formate dehydrogenase (FMD), whose 
controlled expression by very strongly regulated promoters opens up a variety of useful 
possibilities for heterologous gene expression. These facts make Hansenula polymorpha an 
industrially interesting organism (Gellissen et al., 1994). 

So far, only two auxotrophic strains of H. polymorpha are available, whose transformation 
can be selected by functional complementation of ura3- or leu2- deficiency respectively. The 
provision of further transformation systems, consisting of auxotrophic strains and of nucleic 
acids capable of complementing the auxotrophy hitherto failed because suitable genes 
capable of this complementation were not available. While a number of genes from the 
amino acid or nucleic acid biosynthesis pathway of the baker's yeast S. cerevisiae were 
known, it had nonetheless in the past been found that the differences between the genes of 
baker's yeast and the methylotrophic yeasts are sometimes considerable. Hence it cannot in 
general be expected that a S. cerevisiae gene will be suitable for complementation of an 
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auxotrophy in a methylotrophic yeast, nor can it be expected that genes from a methylo- 
trophic yeast will be capable of complementing an auxotrophy in S. cerevisiae. 

Hence a technical problem underlying the present invention is to provide a new gene from H. 
polymorph, which can serve as a selectable marker for complementation in the 
transformation of suitable auxotrophic yeast strains. A further problem consists in the 
provision of vectors and host cells, which contain the gene. Further, it is to provide the 
polypeptide encoded by the gene and antibodies which specifically recognise the polypeptide. 
Finally, it is to provide suitable auxotrophic strains. 

This problem is solved according to the invention by a nucleic acid molecule comprising a 
nucleic acid coding for a polypeptide with chorismate mutase activity or the complementary 
strand thereof, wherein the nucleic acid is selected from 



M (a) a nucleic acid with the DNA sequence stated in SEQ ED NO: I or the RN A sequence 

corresponding thereto; 



(b) a nucleic acid which hybridises with the complementary strand of a nucleic acid 
according to (a); 

(c) a nucleic acid which on the basis of the genetic code is degenerate to the DNA sequences 
defined under (a) and (b); 

(d) a nucleic acid which hybridises with one of the nucleic acids stated in (a) to (c) and the 
complementary strand thereof codes for a polypeptide with chorismate mutase activity; 

(e) a nucleic acid which is at least 60% homologous to one of the nucleic acids stated in (a) 
to(d); 

(f) a variant of the nucleic acids stated in (a) to (e), wherein the variant had additions, 
deletions, insertions or inversions relative to the nucleic acids stated in (a) to (e); 



(g) a fragment of one of the nucleic acids stated in (a) to (f); 
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(h) a combination of several of the nucleic acids stated in (a) to (g), 

wherein the polypeptide encoded by the nucleic acid or complementary strand thereof has at 
least 10% of the chorismate mutase activity of the chorismate mutase according to SEQ ID 

NO:2. 

In microorganisms and plants, the biosynthesis of aromatic amino acids proceeds firstly 
through 7 enzyme-catalysed reactions of the shikimate pathway from erythrose-4-phosphate 
and phosphoenol pyruvate to chorismate (figure 1). Chorismate is the substrate of the first 
branching point. In the baker's yeast Saccharomyces cerevisiae, starting from this branching 
point, on the one hand anthranilate is formed via the enzyme anthranilate synthase (EC. 
4.1.3.27), and on the other prephenate via the enzyme chorismate mutase (E.C. 5.4.99.5). 
Finally, via further intermediate products, tyrosine and phenylalanine are formed from 
prephenate (Braus, 1991). In the baker's yeast Saccharomyces cerevisiae, the chorismate 
mutase is encoded by the AR07 gene (Schmidheini et al, 1989), which is located on 
2 chromosome XVI. AR07 encodes a 0.95 kb mRNA and contains a 77 1 bp open reading 

lil frame, which codes for a protein consisting of 256 amino acids. 

Below, a number of terms are explained in more detail, in order to make clear how they 
should be understood in connection with the present application. 

The term "chorismate mutase", as used below in the description, includes complete 
chorismate mutase, chorismate mutase fragments, chorismate mutase mutants and fusion 
proteins thereof. Polypeptides which have at least 10% of the chorismate mutase activity of 
the chorismate mutase according to SEQ ID NO:2 are regarded as being according to the 
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invention. 



"Chorismate mutase activity" means the catalytic conversion of chorismate to prephenate as 
part of the phenylalanine and tyrosine biosynthesis which is catalysed by the enzyme chor- 
ismate mutase [E.C. 5.4.99.5]. The chorismate mutase activity of the polypeptide according 
to the invention can for example be measured spectrophotometrically through the acid- 
catalysed conversion of the product prephenate to phenylpyruvate, which absorbs at 320 nm 
(Schmidheini etal., 1989). 



For its growth or proliferation, a "prototrophic" microorganism needs only simple nutrients 
(carbon and nitrogen) and minerals, but can itself build up all needed amino acids. Hence it 
is capable of growing on "minimal medium". 

In contrast to this, "auxotrophic" microorganisms need additional factors, for example amino 
acids, which they cannot themselves synthesise owing to a defect in the biosynthesis pathway 
for the factor concerned. In connection with the present invention, the auxotrophy is 
preferably a phenylalanine/tyrosine auxotrophy, which is caused through diminished or 
absent chorismate mutase activity, e.g. of an auxotrophic yeast strain prepared according to 
the invention. 

"Minimal medium" means a nutrient solution, which contains only the components which are 
necessary for the growth of a prototrophic microorganism. In connection with the present 
invention, the minimal medium is preferably a phenylalanine- and/or tyrosine-free medium, 
through which the selection of the phenylalanine/tyrosine auxotrophic form of the micro- 
organism compared to the prototrophic form of the microorganism is made possible. 

"His" tag means a sequence of at least 6 histidine amino acids, which by appropriate cloning 
and fusion with an expressible sequence leads to a fusion protein with at least 6 His residues 
at the NH 2 terminus, which can easily be purified by complexing with a Ni 2+ column. 

A "heterologous gene" is understood to mean the coding region of a structural gene, which is 
either not expressed under the control of its own (homologous) promoter or not in the 
organism from which it is derived, or else is expressed neither under the control of its own 
promoter nor in the original organism. 

"Cloning" is intended to include all cloning methods known in the state of the technology, 
which could be used here, which are however not all described in detail, since they are among 
the obvious tools of the skilled person. 

"Recombinant expression in a suitable host cell" is to be understood to mean all expression 
methods known in the state of the technology in known expression systems, which could be 
used here, which are however not all described in detail, since they are among the obvious 
tools of the skilled person. 
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By selection and sequencing of a genomic clone from Hcmsertula polymorph* which after 
transformation is capable of functionally complementing the phenylalanine/tyrosine auxo- 
trophy of the Saccharomyces cerevisiae aro7A-deletion strain, the inventors have for the first 
time succeeded in identifying a nucleic acid according to the invention, and producing 
auxotrophic mutants of methylotrophic yeasts with this nucleic acid. Thus a further urgently 
needed transformation system for methylotrophic yeasts is provided, which makes the 
targeted selection of transformed yeasts with simple media possible. 

The inventors have prepared a genomic bank from H. polymorpha by cloning of restriction 
fragments of chromosomal H. polymorpha DNA into a shuttle vector, the said shuttle vector 
having a high copy number in yeast and the Hansermla genes being expressed in S. cerevisiae 
under the control of their endogenous promoter. This genomic bank was used for the 
complementation of auxotrophic Saccharomyces cerevisiae strains, although it was not 
known each endogenous promoter of Hansermla polymorpha is active in S. cerevisiae. 

Surprisingly, it was now found that the chorismate mutase gene from Hansermla polymorpha 
is transcribed and translated in S. cerevisiae and is enzymatically active. 

For several reasons, it was not to be expected that the chorismate mutase gene in particular 
could be expressed in Saccharomyces cerevisiae. 

Thus it was not known whether the chorismate mutase gene from H. polymorpha contains 
introns and whether the S. cerevisiae splicing apparatus is capable of the correct processing 
of the possible pre-RNA. From now on, it could be shown that the chorismate mutase gene 
from H. polymorpha has no introns. 

Further, it could not be foreseen that the translation product can be correctly folded and if 
necessary assembled in S. cerevisiae, in order to give a sufficiently active chorismate mutase 
which is capable of catalysing the reaction of chorismate to prephenate. The absence of 
predictability is supported by the fact that HLEU2, the LEU2 homologue from H. 
polymorpha, is not capable of complementing a corresponding mutation in S. cerevisiae 
(Agaphonov et al., 1994). 
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Further, it was not known that the biosynthesis of the aromatic amino acids tyrosine and 
phenylalanine in H. polymorpha involves the conversion of chorismate to prephenate with the 
aid of the enzyme chorismate mutase at all. In fact it is known that there are considerable 
differences in the biosynthesis of aromatic amino acids. Thus in some cyanobacteria 
prephenate is first converted to arogenate by transamination. This pathway is also used by 
plants (Jensen and Stenmark, 1975) and is more widespread in nature than the hydroxy- 
phenylpyruvate/phenylpyruvate pathway, which is used by Saccharomyces cerevisiae and 
E. coli. 

The nucleic acid contained in the nucleic acid molecule according to the invention can be 
genomic DNA, cDNA or synthetic DNA, where a synthetic DNA sequence is also 
understood to mean one such which contains modified internucleoside bonds. Furthermore 
the nucleic acid can be an RNA sequence, which e.g. can be necessary for expression by 
means of recombinant vector systems. The nucleic acid according to (b) is for example 
obtainable by use of a detectably labelled probe which corresponds to one of the sequences 
stated in (a) or a fragment or complementary strand thereof for the screening of cDNA or 
genomic DNA libraries from microorganisms. Preferably the microorganism from which the 
bank is created belongs to a genus selected from: Pichia, Hansenula, Candida, Torulopsis, 
Saccharomyces, Schizosaccharomyces, Kluyveromyces and Yarrowia. In particular, the 
microorganism belongs to a species selected from: Hansenula polymorpha, Saccharomyces 
cerevisiae, Schizosaccharomyces pombe, Kluyveromyces lactis and Yarrowia lipolytica. In 
this, the identification of positive cDNA or genomic DNA clones is performed according to 
standard procedures. See Maniatis et al., Molecular Cloning (1989), Cold Spring Harbor 
Laboratory Press. 

In a preferred embodiment, the hybridisation stated under (b) or (d) is performed under 
stringent conditions. Stringent hybridisation conditions are e.g. incubation at 65°C overnight 
in 7% SDS, 1% BSA ImM EDTA 250 mM sodium phosphate buffer (pH 7.2) and 
subsequent washing at 65°C with 2 x SSC; 0.1% SDS. 

In a preferred embodiment, nucleic acids are provided which are at least 60% homologous to 
the nucleic acid sequence stated in (a). Preferred are nucleic acids at least 80% homologous 



8 



to the nucleic acid sequence stated in (a). Especially preferred are nucleic acids at least 90% 
or 95% homologous to the nucleic acid sequence stated in (a). 

According to the invention, the expression "homology" means homology at the DNA level, 
which can be determined by known procedures, e.g. computer-aided sequence comparisons 
(Basic local alignment search tool, S. F. Altschul et aL, J. MoL Biol. 215 (1990), 403-410). 

The expression "homology", well-known to the skilled person, designates the degree of 
t relatedness between two or more nucleic acid molecules, which is determined by the agree- 

ment between the sequences. The "percentage homology" is obtained from the percentage of 
identical regions in two or more sequences taking account of gaps or other sequence features. 
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The homology of mutually related nucleic acid molecules can be determined by means of 
known procedures. As a rule, special computer programs with the algorithms taking account 
of the special requirements are used. 

Preferred procedures for the determination of homology firstly generate the greatest agree- 
ment between the sequences studied. Computer programs for determination of homology 
include, but are not limited to, the GCG program package, including GAP (Devereux, J., et 
aL, Nucleic Acids Research 12 (12): 387 (1984); Genetics Computer Group University of 
Wisconsin, Madison, (WI)); and BLASTP, BLASTN and FASTA (Altschul S., et aL, J- MoL 
BioL, 215: 403-410 (1990)). The BLASTX program can be obtained from the National 
Centre for Biotechnology Information (NCBI) and from other sources (BLAST Handbook, 
Altschul S., et aL, NCB NLM NIH Bethesda MD 20894; Altschul S., et aL, J. MoL BioL, 
215: 403-410 (1990)). The well-known Smith- Waterman algorithm can also be used for the 
determination of homologies. 

Preferred parameters for the nucleic acid sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. MoL BioL 48: 443-453 (1970) 

Comparison matrix: Matches = + 10 

Mismatches = 0 
Gap penalty 50 
Gap length penalty: 3 



The GAP program is also suitable for use with the above parameters. The above parameters 
are the default parameters for nucleic acid sequence comparisons. 



Further examples of algorithms, gap opening penalties, gap extension penalties and 
comparison matrices including those named in the program handbook, Wisconsin package, 
Version 9, September 1997, can be used. The choice will depend on the comparison to be 
performed and further on whether the comparison is performed between sequence pairs, 
when GAP or Best Fit are preferred, or between one sequence and a large sequence database, 
when FASTA or BLAST are preferred. 

An agreement of 60% determined with the aforesaid algorithm is described in the context of 
the present application as 60% homology. The same applies for higher degrees of homology. 

In a preferred embodiment, the nucleic acid according to the invention is a combination of 
several of the nucleic acids stated under (a) to (f), which can be obtained by the fusion and if 
necessary cloning known to the skilled person. These combinations can e.g. be of especial 
interest for the creation of immunogenic constructs. 

In a further preferred embodiment, the polypeptide encoded by the nucleic acid according to 
the invention has at least 50% of the chorismate mutase activity of the chorismate mutase 
SEQ ID NO:2. It is especially preferred that the polypeptide has at least 75% of the chorism- 
ate mutase activity according to SEQ ID NO:2. Here, as already stated, the chorismate 
mutase activity is measured after Schmidheini et al., 1989. 

In a further embodiment, the nucleic acid molecule comprises a promoter suitable for its 
expression, the nucleic acid being under the control of the promoter. The choice of the 
promoter depends on the expression system used for expression, in particular on the choice of 
host organism. Preferred according to the invention are both constitutive promoters such as 
PGK- and G3PDH- promoters and also inducible promoters such as GAL4, ADH2 and the 
Cu-metallothionein promoter (survey in Recombinant Gene Expression Protocols in Methods 
in Molecular Biology Vol.63, Ed. R S Tuan, Chapter IE; Schena, M. et al. (1991) Vectors for 
constitutive and inducible expression in yeast, Methods in Enzymol. 194, 389-398). Further, 
the use of mammalian glucocorticoid response elements (GRE) for increasing transcription is 
also taken into account by the invention (Schena et al., Science 241 (1988), 965-967). 
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Also preferred are the promoters of the methanol metabolism of methylotrophic yeasts, in 
particular Hansenula polymorphs The genes for the methanol metabolism enzymes in 
Hansenula polymorpha are among the most strongly expressed and regulated genes that have 
so far been described in yeasts (Van der Klei et aL, 1991). The corresponding proteins can 
comprise up to 30% of the total protein content of the cell (Janowicz et al., 1985; Ledeboer et 
aL, 1985). The expression of the methanol metabolism genes can on the one hand be induced 
by methanol, on the other hand however, the presence of glycerol also leads to derepression. 
Thus in this way the strong promoters of the methanol metabolism can also be used for 
heterologous gene expression under methanol-free culture conditions. 

Especially preferred promoters are the methanol oxidase MOX-promoter described in H. 
polymorpha, which at about 1.5 kb is unusually large and is among the strongest yeast 
promoters so far described. The presence of glucose leads to repression of the MOX- 
promoter, however the activity of this promoter can be increased more than 1000-fold by 
glycerol or methanol (Godecke et al., 1994). Further especially preferred is the dihydroxy- 
acetone synthase D/tS-promoter (Ledeboer et al., 1985) and the M)-promoter (European 
Patent 299108). 

In a further preferred embodiment, the nucleic acid molecule also comprises a signal peptide- 
coding nucleic acid sequence, which ensures the export of the expressed protein, where the 
signal peptide-coding nucleic acid sequence is preferably directly 5' bound to the hetero- 
logous gene to be expressed. For the secretion and modification of many eukaryotic proteins, 
it is necessary to fuse the protein sequence at the N-terminus with a signal sequence, in order 
to steer the polypeptides into the secretion apparatus. Possible for this are for example 
components from the S. occidental gene GAM1 (Dohmen et al., 1990) and from a hormone 
gene of the crab Carcinus maenas (Weydemann et aL, 1989), which were successfully used 
for the secretion of hirudin (Weydemann et al., 1995). 

In a further preferred embodiment, the nucleic acid molecule also comprises at least a part of 
a vector, in particular control regions, wherein the vector can be selected from: 
bacteriophages such as X-derivatives, plasmids, adenoviruses, vaccinia viruses, baculoviruses, 
SV40 viruses and retroviruses, preferably MoMuLV (Moloney Murine Leukemia Virus). 
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Especially preferred are yeast transformation vectors, both integrative yeast plasmids (Yip) 
and also extrachromosomal plasmid vectors being possibilities. The extrachromosomal 
plasmid vectors subdivide into episomal yeast plasmids (YEp), replicative yeast plasmids 
(YRp) and yeast centromer plasmids (YCp) (see Singh, K.K. and Heinemann, J. A., Chapter 
1 1 in Recombinant Gene Expression Protocols, v. supra). Further, artificial yeast chromo- 
somes (YACs) are also possible as expression vectors according to the invention. 

Also especially preferred vectors are yeast replication plasmids, which contain a replication 
origin ori and an antibiotic resistance cassette, so that they can be propagated and selected in 
E. coli. Furthermore, they bear an ARS sequence for chromosomally independent replication 
in the yeast cell, such as for example HARS1 from H. polymorpha, and a metabolic yeast 
selection marker, such as for example URA3 oxHLEU2 (Gellisen and Hollenberg, 1997). 

Many heterologous proteins have already been produced in H. polymorpha (Gellissen and 
Hollenberg, 1997) and by the placement of more than one expression cassette on one trans- 
formation vector, co-expression of different genes is also possible (Gilbert et al., 1994). 

Further, a nucleic acid molecule is preferred which in addition comprises a His-tag coding 
DNA sequence, which on expression of the construct leads to the formation of a fusion 
protein with a His-tag at the NH 2 terminus, which facilitates the purification of the protein on 
a nickel column through chelate formation. 

According to the invention, host cells are provided, which contain the nucleic acid molecule 
and which are suitable for the expression of the heterologous gene. In the state of the 
technology, many prokaryotic and eukaryotic expression systems are known, where the host 
cells are for example selected from prokaryotic cells such as E. coli or B. subtilis, from 
eukaryotic cells such as yeast cells, insect cells and mammalian cells, e.g. CHO cells, COS 
cells or HeLa cells, and derivatives thereof. In the state of the technology for example, 
certain CHO production lines are known whose glycosylation patterns are altered compared 
to CHO cells. 



Preferably, the yeast belongs to a genus selected from: Pichia, Hansemda, Candida, 
Torulopsis, Saccharomyces, Schizosaccharomyces, Kluyveromyces and Yarrowia. In 
particular, the microorganism belongs to a species selected from: Hansenula polymorpha, 
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Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces lactis and Yarrowia 
lipolytica (see Recombinant Gene Expression Protocols in Methods in Molecular Biology, 
above). 

A further object of the present invention is a process for the production of a polypeptide with 
chorismate mutase activity. For this, the nucleic acid molecule according to the invention is 
expressed in a suitable host cell and the protein isolated from the host cell or the medium by 
standard procedures. 

Many procedures for the expression of DNA sequences are known to the skilled person; see 
Recombinant Gene Expression Protocols in Methods in Molecular Biology, above. The 
expression can be both constitutive and also inducible, inducers such as for example IPTG 
and Zn 2+ being known to the skilled person. The polypeptide with chorismate mutase activity 
produced can, if a His-tag was fused to the NH 2 terminus of the polypeptide, be purified by 
chelate formation on a nickel column. Preferably the polypeptide with chorismate mutase 
activity is purified by ion exchange chromatography and/or gel filtration chromatography. 
The implementation of these measures is well-known to the skilled person. 

In a further preferred embodiment, the polypeptide with chorismate mutase activity produced 
according to the invention is modified. Here the modifications include the di-, oligo- and 
polymerisation of the monomeric starting product for example through crosslinking, e.g. with 
dicyclohexylcarbodiimide or pegylation or association (self-assembly). Other modifications 
include side-chain modifications, for example of e-amino-lysine residues of the polypeptide, 
or amino- or carboxy-terminal modifications. Other modifications include posttranslational 
events, e.g. the glycosylation or the partial or complete deglycosylation of the protein. 

In a preferred embodiment, the polypeptide obtained in recombinant expression in 
prokaryotes or glycosylation-deficient eukaryotes is non-glycosylated. Also taken into 
account according to the invention is a polypeptide with chorismate mutase activity which is 
glycosylated by recombinant expression in eukaryotes capable of glycosylation such as yeast 
cells insect cells or mammalian cells, such as CHO cells or HeLa cells. 
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In a further embodiment, polypeptides with chorismate mutase activity are made available, 
which include an amino acid sequence, the amino acid sequence being encoded by one or 
several of the nucleic acid molecules according to the invention. 

Preferably, polypeptides with chorismate mutase activity are made available, which comprise 
the amino acid sequence in SEQ ID NO:2 or a fragment thereof, which has at least 10% of 
the chorismate mutase activity of the chorismate mutase according to SEQ ID NO:2, 
preferably more than 50% and especially preferably more than 75%. 

In a further embodiment, the invention provides polypeptides with chorismate mutase 
activity, obtainable by the recombinant production process or modifications thereof. 

Further, non-glycosylated and glycosylated polypeptide with chorismate mutase activity, 
obtainable by expression in host cells capable or incapable of glycosylation, is provided. 
Depending on the intended use of the polypeptide, the glycosylation pattern of yeast, in 
particular methylotrophic yeast such as Hansenula polymorpha, of COS or HeLa cells can be 
preferred. 

The invention further provides antibodies which specifically react with the polypeptide with 
chorismate mutase activity according to the invention and are obtainable by immunisation of 
an experimental animal with the polypeptide. Polyclonal antibodies can be obtained by 
immunisation of for example rabbits, mice or rats and subsequent extraction of antisera. 
Monoclonal antibodies can be obtained by standard procedures by immunisation of e.g. mice, 
extraction and immortalisation of the spleen cells and cloning of the hybridomas which 
produce antibodies specific for the polypeptide. 

In a further embodiment, the invention provides a process for the production of a phenyl- 
alanine and tyrosine-auxotrophic yeast strain, comprising the destruction of the endogenous 
chorismate mutase gene of the corresponding yeast strain, wherein the mutant displays less 
than 10% of the chorismate mutase activity of the chorismate mutase according to SEQ ID 
NO:2. 



According to the invention, homologous recombination is possible as a procedure for the 
production of the mutant. The disruption of the chorismate mutase gene by homologous 
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recombination is based on the replacement of the endogenous chromosomal copy of the gene 
by an inactivated copy. For the preparation of a disruption construct, the cloning of larger 
regions of the gene to be destroyed, is necessary for the efficient homologous recombination, 
preferably including the 5' and 3' regions. The cloned regions should preferably cover at 
least 2 kb. 

A Bgin digestion of genomic DNA from Hansermla polymorph* and subsequent Southern 
Blot analysis with a chorismate mutase-specific probe such as e.g. SEQ ID NO:3 gave two 
bands of 3.2 kb and 3.0 kb respectively. Further investigation showed that the 3.2 kg Bglll/ 
BgUI fragment includes 690 bp of SEQ ID NO:3 and contains the flanking 5' region. The 
3.0 bp Bgin fragment includes 960 bp of SEQ ID NO:3 and contains the flanking 3' region. 
The isolation and directed fusion of the 3.2 kb fragment with the 3.0 kb fragment by means of 
standard procedures known to the skilled person leads to a 6.2 kb fragment, which includes 
the chorismate mutase gene and large 5' and 3' flanking regions. 

According to the invention, the nucleic acid to be destroyed, preferably a nucleic acid 
according to SEQ ID NO.l or a nucleic acid homologous thereto, especially preferably the 
nucleic acid according to SEQ ID NO:3 or a nucleic acid homologous thereto, in particular 
the nucleic acid including the 6.2 kb genomic DNA fragment, a nucleic acid homologous 
thereto or parts thereof, is cloned and mutated by oligonucleotide exchange, wherein the 
mutation can include additions, deletions, inversions or substitutions, which decrease the 
expression of the gene or lead to an inactive translation product. The mutation preferably 
takes place in at least one region or parts thereof, which are selected from the nucleic acid 
positions corresponding to the amino acid positions (SEQ ID NO:2) 10 to 20, 154 to 167, 192 
to 196 and 240 to 247; these regions supposedly form the catalytic centre of the chorismate 
mutase. It is preferable to inactivate the nucleic acid coding for the chorismate mutase by 
cloning in a longer oligonucleotide. 

Preferably the endogenous chorismate mutase gene is present as a single copy gene. 

Preferably the oligonucleotide used for the oligonucleotide exchange comprises at least one 
selectable marker such as an antibiotic resistance or a metabolic marker. According to the 
invention, all selectable markers known in the state of the technology are included. 
Alternatively, the construct can also be prepared synthetically. For efficient homologous 
recombination, the construct should have a length of at least 2 kb. 
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The cloned-in oligonucleotide is flanked both 5' and 3' by chorismate mutase-specific 
fragments, whose sequences correspond to the sequences stated in claim 1 (a) to 1 (h) or are 
complementary to these and can hybridise with the original chromosomal copy. 

The construct is first linearised for the production of a disruption mutant. Next, the proto- 
trophic yeast strain intended for the production of the disruption mutant is transformed with 
the construct. Since the construct comprises a selectable marker, the transformants are 
selected by appropriate selection pressure. The phenylalanine/tyrosine-auxotrophic 
transformants are identified by growth on phenylalanine/tyrosine-free medium. 

In a preferred embodiment, the process comprises the steps: 

a) Preparation of a construct comprising at least two fragments of a nucleic acid according to 
claim 1 (a) to 1 (h) suitable for homologous recombination, which flank a nucleic acid 
unsuitable for homologous recombination; 

b) Transformation of cells of a yeast strain with an intact endogenous chorismate mutase 
gene with this construct 

and 

c) Identification of the phenylalanine and tyrosine-auxotrophic transformants. 

Especially preferably, the construct also comprises a selection marker gene. Further the 
construct can comprise one or several recombination sites, which preferably 5' and 3' flank 
the selectable marker and make possible the excision of the selectable marker from the 
construct after disruption of the endogenous chorismate mutase gene has been effected. 

Preferably the recombination site is loxP. In an especially preferred embodiment, the process 
further includes in step b) that the cells of the yeast cells in step b) are brought into contact 
with nucleic acid suitable for the expression of Cre-recombinase, as a result of which the 
excision of the selectable marker by means of the loxP/Cre-recombinase system is made 
possible. 
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In an especially preferred embodiment, the yeast strain is Hansenula polymorphs 

In a further embodiment, the invention provides a phenylalanine and tyrosine-auxotrophic 
yeast strain, such as is obtainable e.g. by the process according to the invention. 

In a further embodiment, a process for the recombinant production of proteins is provided, 
which includes: 

a) Transformation of an auxotrophic yeast strain according to the invention with a 
combination of a nucleic acid according to the invention suitable for expression and a 
heterologous gene suitable for expression under the control of a suitable promoter; 

b) Culturing of the transformants under conditions suitable for the expression of the 
heterologous gene and of the nucleic acid molecule according to the invention and if 
necessary isolation of the protein which is encoded by the heterologous gene. 

In a preferred embodiment, the process further includes the step of the selection of the 
transformants on a phenylalanine and/or tyrosine deficient medium. 

The recombinant production process according to the invention can be performed with 
nucleic acid molecule according to the invention and heterologous gene spatially separate 
from one another, where the two can be provided in two vectors (binary vector system) and 
can be transformed independently of one another. Alternatively, the nucleic acid molecule 
according to the invention, which codes for a polypeptide with chorismate mutase act.vty, 
and the heterologous gene are present in one vector. These cointegrate vectors have the 
advantage that the selectable marker gene and the heterologous gene are covalently bound to 
one another, so that all prototrophic clones after transformation also contain the heterologous 
gene, as a result of which the effort of screening is reduced. 

Promoters and vectors preferred according to the invention for the expression of the 
heterologous gene correspond to the promoters and vectors stated above for the expression of 
the nucleic acid molecule according to the invention. The vector optionally also contams a 
signal peptide-coding nucleic acid sequence, which causes the secretion of the recombinant 
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protein into the medium, and which is directly 5' bound to the heterologous gene. Also 
advantageous is the addition of a His-tag coding nucleic acid sequence to the 5' end of the 
heterologous gene, which after recombinant expression makes purification on a nickel chelate 
column possible. 

The invention further provides recombinant proteins which are obtainable by the process 
stated above. 

It is intended to illustrate the invention with the following figures and examples, but in no 
way to limit it. On the basis of the description and the examples, other embodiments are 
accessible to the skilled person, which are also included. 

m Figure 1 shows the biosynthesis of the aromatic amino acids via the shikimate pathway. 

y3 

I Figure 2 shows the ligation of the DNA fragments into the vector P RS426. The vector was 

O linearised with the restriction enzyme BamHI and the genomic Sau3 A DNA fragments from 

| H. polymorph were ligated into this cleavage site. This is possible, since the cleavage srtes 

O BamHI and Sau3A are compatible. 

! Figure 3 shows the growth comparison of cells transformed with 2 complementing DNA 

fragments To highlight the different growth rates, cell material of the cells complemented 
with a 5 kb (pME1524) or a 1.8 kb ( P ME1525 subclone of pME1524) DNA fragment was 
diluted in water and 20 ul of each dilution stage was applied onto minimal medium. The 
optical density of the different dilution stages is the same in each case, however since only 
the fundamentally different growth rates had to be illustrated, no accurate growth parameters 
were determined. 

Figure 4 shows the autoradiograph of a hybridisation of chromosomal DNA from different 
fungi with a 1.8 kb DNA fragment from H. polymorphs which is capable of complementing 
the phenylalanine/tyrosine auxotrophy of the 5. cerevisiae aro7A selection strain. Chromo- 
somal DNA from S. cerevisiae (1), A. mdulans (2) and H. polymorpha (3) was cleaved w,th 
the restriction enzyme EcoRV and hybridised with a *P radioactively labelled DNA probe, 
prepared using the genomic 1 .8 kb DNA fragment from H. polymorpha. 
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Figure 5 shows the sequence of a genomic 1.8 kb fragment (SEQ ID NO:3) which is capable 
of complementing the phenylalanine/tyrosine auxotrophy of the S. cerevisiae aro7A selection 
strain. The 5' and 3' region are shown in small letters, the open reading frame (SEQ ED 
NO.l) and the primary sequence (SEQ ID NO:2) of the derived gene product in large letters. 

Figure 6 shows the amplification of the flanking regions of the HAR07-ORF. The flanking 
regions of the HAR07 gene were amplified with the primers OLSK34/T7 and OLSK35/T3 
respectively and then cleaved with Apal/Bgin and BglDTXbal respectively. 

This is possible, since the primers OLSK34 and OLSK35 contain BglH recognition 
sequences. 



f Figure 7 shows the production of various deletion constructs. The flanking regions of the 

N* HAR07-ORF were ligated into the Apal/Xbal cleaved vector pBluescriptKSn as Apal/BglH 

J and BglDTXbal fragments respectively. The vector pBluescript H KS is marketed by the firm 

Stratagene, USA. The vector was again opened with Bgin and the disruption cassettes 
cleaved with Bamffl and BglH respectively were ligated in. 
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Material and Methods 

1. Material 
1.1 Chemicals 

Chemicals for the preparation of solutions, buffers and media were obtained from the firms 
Merck (Darmstadt, Germany), Boehringer Ingelheim Byproducts (Heidelberg, Germany), 
Carl Roth GmbH & Co KG (Karlsruhe, Germany), Gibco BRL (Life Technologies GmbH, 
Karlsruhe, Germany), Fluka (Neu-Ulm, Germany) and Sigma-Aldrich Chemie GmbH 
(Steinheim, Germany). 

Restriction enzymes, DNA-modifying enzymes and polymerases were purchased from MBI 
Fermentas (Vilnius, Lithuania) and RNase A from Boehringer Mannheim GmbH (Mannheim, 
Germany). As the DNA size standard the "1 kb DNA-ladder" (Gene Ruler Plus) from MBI 
Fermentas was used. Agarose was obtained from Roth. 

For the preparation of plasmid DNA from Escherichia coli and for the extraction of DNA 
from agarose gels, Kits from the firm Qiagen (Hilden, Germany) were used. 
Synthetic oligonucleotides were purchased from Nucleic Acid Products Supply Gottingen 
GmbH (Gottingen, Germany) and Gibco BRL (Life Technologies GmbH, Karlsruhe, 
Germany). 

Contract sequencing was performed by the firm MWG-Biotech GmbH (Ebersberg, 
Germany). 

1.2 Strains, Plasmids and Oligonucleotides 

For cloning, the E coli strain DH5a [F>80dlacZAM15,A(lacZY A-argF),U169,deoR,recAl, 
endAl,hsdR17,(r K -,m K + ),supE44A-,thi-l,gyrA96,relAl] was used (Woodcock, 1989). For 
this work, the K polymorpha strain RBI l(ura3) was used (Weydemann et al., 1995). 

The 5. cerevisiae strains used are set out in table 1. 

The plasmids and oligonucleotides needed for this work are listed in tables 2 and 3. 
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Strain 


Genotype 


Reference / 


^H2185 


MA TaAaro7::LEU2 suc2S9 ura3-52 leu2- 
3 Ieu2-U2 his4-519 


Schnappauf et al/1997 


RH1405 


MATasuc2-69 ura3-52 leu2-3 Ieu2-U2 
his4-519 


Schnappaufet al., 1997 





1 Plasmid 


Description 


/ Reference 


n 

•ess? 


pRS426 


5726 bp shuttle vector, bla URA3 ^ 


Sikorski and Hieter, 
1989 


Ul 


pBluescript KS II 


2961 bp vector, bla, lacZ, Multiple Cloning 
Site (MCS) / 


Stratagene 


CP 
Ms 

Hi s 


pME1513 


V426MET25 with altered MCS/focI 
/?A4Er25 Xbal Spel flo/nffl ^1 Notl 
Xhof CYCI Kpnl) / 


Probst, 1998 


i ft 

■KT i; 


pME1524 


5 kb Sau3A Fragment from H. p. in BamHI 
cleaved pRS426 / 






pME1525 


1.8 kb [ApaVHindm\-^agmQxA from pME 
1524inpRS426/SmaI) 






pME1526 


pBluescnpt II KS (ApaVHindHT) + 1.8 kb 
(Apal/Hinam)-Sragment from pME1524 






I able 2: Plasmids used ^ ■ - 1 




Oligonucleotide 


Size 
/ 


Sequence 




T3 ~r 


20-m'er 


5'-(AATTAACCCTCACTAAAGGG)-3 ' 




T7 


22/mer 


5'-(GTAATACGACTCACTATAGGGC)-3' 




OLSK34 


26-mer 


5 '-(ATATAGATCTAC AAA_AACT AAACAGG)-3 ' 




OLSK35 t 


'28-mer 


5 '-(AT AT AGATCTGATGCG- ACGC AGAAAAGC)-3 ' 



Table 3: Oligonucleotides used 
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2. Methods 

2.1 Culturing of the Microorganisms 
2.1.1 Escherichia coli 

The cells were cultured in Luria-Bertani medium (LB: 1% tryptone, 0.5% yeast extract, 1% 
NaCl) at 37°C. For strains with the ampicillin resistance marker, 50 mg/1 ampicillm were 
added to the medium. 

2.1.2 Hansenulapofymorpha 

The cells were cultured at 37°C either in "yeast extract peptone dextrose medium" (YEPD: 
2% peptone, 1% yeast extract, 2% glucose) or in "yeast-nitrogen-base medium" (YNB: 
0 150/0 yeast nitrogen base, 0. 5 o/o (NH^SO*, 0.1% myo-inositol (200 mM), 5% glucose, 
supplemented with uracil for the present strain). All media were autoclaved before use and 
for solid media 2% agar was added. 

2.1.3 Saccharomyces cerevisiae 

The cells were cultured at 30°C either in "yeast extract peptone dextrose medium" (YEPD: 
2% peptone, 1% yeast extract, 2% glucose) or in "minimal vitamins medium" (MV: 0.15% 
yeast nitrogen base, 0.52% ammonium sulphate, 2% glucose, 1% succinate, 0.3% KOH). 
Supplements such as L-amino acids or uracil were added after Sherman et al. (1986) and, hke 
the antibiotics, sterile filtered or autoclaved and added to the sterile medium. For sohd 
media, 2% agar was added. The growth of the yeast cells was followed by measurement of 
the optical density at 600 nm. 

2.2 Isolation of Nucleic Acids 

2.2.1 Qiagen Plasmid DNA Preparation from Escherichia coli 

Firstly the cultured bacteria were centrifuged offand the sediment formed was resuspended 
in 0.3 ml of buffer PI (buffer description as per manufacturer). Then 0.3 ml of buffer P2 
were added and the mixture incubated for 5 mins at room temperature. In the next step, 
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0.3 ml of the cooled buffer P3 were added and again incubated for 5 mins, this time on ice. 
Next, the mixture was centrifuged for 10 mins (12,000 rpm) and the supernatant added to a 
Quiagen-tip 20 column previously equilibrated with buffer QBT. After the supernatant had 
passed through the column, this was washed four times with 1 ml portions of buffer QC. The 
bound DNA was eluted with 0.8 ml of buffer QF. The eluted DNA was precipitated with 0.7 
volumes of isopropanol and centrifuged off (30 min, 10,000 rpm). Finally the DNA was 
washed with 1 ml of 70% ethanol, dried and taken up in H 2 0. 



2.2.2 



Isolation of Plasmid DNA from Escherichia coli (Birnboim and Doly, 1979) 



O The K coU we re cultured overnight in 5 ml of LB-Amp at 37°C on a rotary shaker. 

1.5 ml of the culture were centrifuged off in an Eppendorf reaction vessel and the cells 
resuspended in 100 ul of solution I (50 mM glucose, 10 mM EDTA, 20 mM Tris-HCl, P H 
8.4, 4 mg/ml lysozyme). After 7 mins incubation at RT, 200 ul of solution H (0.2 mM 
NaOH, 1% w/v SDS) were added, the mixture gently shaken and incubated for 5 mins on ice. 
50 ul of solution HI (3 M Na acetate, pH 4.8) were added, the mixture again gently shaken 
and incubated a further 7 mins on ice. The cell debris was centrifuged off and the super- 
natant extracted with 450 ul of CH 2 Cl 2 -saturated phenol in TE buffer (10 mM Tris-HCl, 
1 mM EDTA, pH 8.0) and then with 450 ul of CH 2 C1 2 . The plasmid was precipitated by 
addition of 1 ml of cold EtOH and incubation at -20°C. After 30 mins centrifugation at 4°C, 
the precipitate was washed with 200 ul of 70% EtOH, dried invaoio and taken up in 50 ul 
of TE buffer (incl. 25 ug/ml RNase A). To dissolve the DNA this was heated for 5 mins 
at 65°C, and RNA degradation was effected using the heat-stable RNase A by incubation for 
20 mins at 37°C. The plasmid DNA solution was stored at -20°C. 

2.2.3 Isolation of Chromosomal DNA from Hansenula potymorpha 
(Hofman and Winston, 1987) 

10 ml of YEPD medium were inoculated and incubated for about 15 hrs at 37°C. The cells 
were centrifuged off, resuspended in 0.5 ml of distilled water and again centrifuged off. The 
supernatant was discarded and 0.2 ml of lysis buffer (2% Triton X-100, 1% SDS, 100 mM 
NaCl, 10 mM Tris, pH 8.0, 1 mM EDTA), 0.2 ml of phenol/MeCl 2 /TE and 0.3 g of glass 
spheres were added. To disintegrate the cells this was shaken for 3-4 mins and then the cell 
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debris was centrifuged off (5 min, 13,000 rpra). The aqueous phase was placed in a new 
container and after addition of 1 ml of ethanol the DNA was precipitated and again 
centrifuged off. The sediment was resuspended in 0.4 ml of TE (10 mM Tris-HCl, 1 mM 
EDTA pH 8.0) and 3 ul of RNase A (10 mg/ml) were added, whereupon the mixture was 
incubated for 5 mins at 37°C. Now 10 ul of 4M ammonium acetate and 1 ml of ethanol were 
added, the precipitated DNA was centrifuged off, the supernatant discarded, dried and taken 
up in 50 ul TE [sic]. 

2.3 Cloning Techniques 

2.3.1 Polymerase Chain Reaction ('PCR'; Saiki et aL, 1985) 



Polymerase chain reactions were performed with the heat-stable enzyme Taq-polymerase 
I (Fermentas) in GeneAmpTM reaction vessel (Eppendorf). Usually each time 5-50 nmol of 

| primer oligonucleotide and 10-100 ng of DNA as matrix were used in 20-50 ul of reaction 

? buffer in accordance with the manufacturer's instructions. As a rule, 30 cycles of the 

following temperature regime were performed: 30 sees denaturation at 94°C / 30 sees 
hybridisation at specific reaction temperature / 30 sees - 3 mins synthesis at 72°C for Taq- 
DNA polymerase. The PCR cycles were initiated by a 3 minute denaturation step and 
finished by a final synthesis step of 5 mins. 



2.3.2 DNA Restriction 

For analytical restriction reactions, ca. 0.5 ug of DNA were incubated with 1-2 units of 
restriction enzyme in a volume of 20 ul for 2-3 hrs at 37°C. For the restriction of preparative 
quantities of DNA correspondingly larger volumes and quantities of enzyme were used. 
Reaction buffers were used in accordance with the manufacturer's instructions. 

2.3.3 Agarose Gel Electrophoresis 

0.1 vol. of DNA dye (25% w/v Ficoll 400, 0.25% w/v Bromphenol blue, 0.25% w/v 
xylenecyanol, 200 mM EDTA pH 8.0) were added to the restriction mixture, and the DNA 
fragments were separated electrophoretically in a horizontal agarose gel in TAE buffer 
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(40 mM Tris acetate, 20 mM NaOAc, 2 mM EDTA, pH 8.3) in the presence of 0.5 ug/ml 
ethidium bromide. DNA bands were detected with a UV Transilluminator (254 mm). 

2.3.4 Isolation of DNA Fragments from Agarose 

To isolate DNA fragments from the agarose gel, the columns and buffers of the QIAquick 
Gel Extraction Kit Protocol were used in accordance with the manufacturer's use directions. 
For this, the DNA fragments were firstly cut out from the gel and weighed. 3 volumes of the 
buffer QX1 were added, and incubated for 10 mins at 50°C. As soon as the gel fragment had 
completely dissolved, 1 volume of isopropanol was added and the mixture was placed on the 
QIAquick column and centrifuged off. To wash the DNA bound onto the column, 0.75 ml of 
PE buffer were added and again centrifuged. Finally, the DNA was eluted with 50 ul of H 2 0. 

2.3.5 Ligation of DNA Fragments (Maniatis et al., 1989) 

Linear DNA fragments with sticky or smooth ends were ligated in a reaction mixture in a 
total volume of 20 ul with ligation buffer (20 mM Tris-HCl, 10 mM MgCl 2 , 10 mM DTT, 
0.6 mM ATP, pH 7.6) and 5 units of T4-DNA ligase overnight at 15°C or for 5 hrs at room 
temperature. The concentration of DNA was between 1 and 10 ug/ml, and the vector/insert 
DNA mole ratio between 1 :5 and 1 : 10. Following the ligation reaction, the DNA was used 
for transformation without further purification. 

2.4 Transformation Methods 

2.4.1 Transformation ofE. coli (Inoue et al., 1990) 

Cells of a 250 ml SOB culture (2% tryptone, 0.5% yeast extract, 10 mM NaCl, 2.5 mM KC1, 
10 mM MgCl 2 , 10 mM MgS0 4 ) in the exponential phase were centrifuged off for 10 mins at 
4°C (lOOOxg) and resuspended in 80 ml of TB buffer (10 mM PIPES, 15 mM CaCl 2 xH 2 0, 
250 mM KC1, pH 6.7, 55 mM MnCl 2 xH 2 0). After incubation (10 mins on ice), the cells were 
again centrifuged off at 4°C and 2,500 rpm. Then the sediment was carefully resuspended in 
20 ml of TB buffer and also carefully mixed thoroughly with 7% DMSO. After 10 min 
incubation on ice, the now competent cells were aliquotted, shock-frozen in liquid N 2 and 
stored at -70°C. 200 ul of competent cells were added to 1-10 ug of DNA and incubated on 
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ice for 30 mins. After 30 sees heat shoek at 42°C, the mixture was ehilled on ice, treated 
with 800 nl of SOC medium (SOB, incl. 20 mM glucose) and shaken for 1 hr at 37°C. 
Various amounts (10-900 ul) of the cell culture were plated out on solid selective medium 
(agar plates with LB complete medium, supplemented with 50 ug/ml ampicillin) and finally 
the plates incubated overnight at 37°C to give transformed cells time for colony formats. 

2.4.2 S. cerevisiae Transformation (modified after Ito et al., 1983) 

0 5 ml of a fresh yeast culture, grown overnight, were used per transformation mix. The cells 
were centrifuged off at room temperature (5 mins, 3,500 rpm) and the supernatant was 
removed. Now the transforming DNA and 50 ug of carrier DNA were added and nnxed. 
J Finally 0 5 ml of PEG (40% PEG3350, 0.1M LiOAc, 10 mM Tris pH 7.5, 1 mM EDTA, 

S 0 1 M DTT) were added, and the transformation mixes were shaken and incubated for at least 

S 12 hrs at room temperature. After a 20 min heat shock (42°C), the cells were carefully 

f centrifuged, and incubated in 1 ml YEPD for 1 hr at 30°C. Finally the cells were centrifuged 

| for 5 sees in the bench centrifuge, resuspended in the remaining liquid after discarding of the 

fu supernatant and plated out on selection media, 

yhj 

ffr. 

2.5 Hybridisation Techniques 
2.5.1 Southern Hybridisation (Southern, 1975) 

For the Southern hybridisation, ca. 10 ug of chromosomal DNA were dissolved in TE buffer 
(10 mM Tris-HCl, 1 mM EDTA, pH 8.5) and subjected to restriction digestion for 12 hrs 
with a suitable restriction enzyme. The cleaved DNA was separated in a 1% agarose gel. 
The agarose gel was washed 3 times, for 20 mins each time, firstly in 0.25M HC1, then m 5M 
NaOH/1 0M NaCl and finally in 1M N^OAc. Now the separated DNA was transferred by 
12-hour dry-blotting onto a nitrocellulose membrane. This was next washed for 2 nuns m 
2 x SSC (0. 1M NaCl, 10 mM NaOAc, pH 7.0), dried in air, and finally "erosslinkmg" was 
performed for 5 mins under UV light (254 nm). 

The membrane was prehybridised for 30 mins at 65°C with 50 ml of Church buffer (7% SDS, 
l«/o BSA, 1 mM EDTA, 250 mM Na phosphate, pH 7.2), after which half of the hybridisation 
solution was discarded. Now the radioactively-labelled DNA probe was added and the 
hybridisation was performed overnight at 65°C. Following the hybridisation, the membrane 
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. «or with 2 x SSC/0.1% SDS and then exposed on an Xray film 
was washed for 30 nuns at 65 C with 2 x as 

or on a phosphor imaging plate. 

2.5.2 Preparation of the Probe DNA 

required was previously amplified oy rv, 

• *• n of the Chorismate Mutase from Hansenula polymorpka 
2.6 Characterisation of the cnonsma 

2.6.1 Crude Extract Preparation 

, A 'n YNB medium, individually supplemented for the particular 
The yeast cells were cultured in of ca^4 Crude extracts were prepared as described 

yeast strains, and harvested at an OD 54 6 of ca. 
byKradolfer etal. (1977), using an Amicon French Press. 

2.6.2 Determination of Protein Content 

^^^^^^^^^ 
(\976) with BS A as protein standard. 

2.6.3 Measurement of Enzyme Activity 

e„, of the chorismate mouse activity, stoptests, such as those developed* 
For the measuremeu of herons ^ ^ ^ tegs ^ 

Schmidhe,me.al.(19S9).w«hafew ^ 
^onmeasurementswerepe^rm^ 30C M(<kt 

protein fraction concerned. The reaction, 1Q mins 

Ledhyadditionofchorismateataftnafconcenfrattono ^ 

„«„ 500 p. oHM HO, which also effects the converston of prephenate 
which is measurable at OD320 



.:■ . , - .1^x1 v. " 
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HeotraBsafion by addition of4mloflM NaOH ended ,he reaction. The measuremem - 
«*. was performed against H,0, and sin* so.u,ions without enzyme were used as null 
vata e, Themea.remen.swe.e.soperfonnedin.hep.esenceofSMMMryp.ophan. The 

mn value, of .he measurements without e^yme were subbed from the 
Lure ^.erex.inCioncoefflcien.ofpheny.pynrvarearar.C of 13095 ^(moUcm), 
the conversion rare (umo, producer * mg prorein)) and hence ,he specific acttvtty 
(conversion/mg protein) could be calculated. 

Results 

,. Preparation of a Genomic Gene Bank from Ha^mlapolymorpHa 
For the preparafion of a genomic gene bant chromosome, DN A fiom 

^ The recognition sequence of this enzyme is relatively frequently present . the 
genome, since it consists of omy 4 bases, ^nng the 5-hour ^ to ""~ way 
Len a. 15 min intervals, and the reaction in each stopped by addttton of EDTA. In fin. w , 

cleaved DNA was separated in an agarose gel, divided into different size fractions benveen 
" k b andLeapondingiyextracedfromthegel. The e*^ fragment :*~m 

BamHI (figure 2). 

THe pUsmid hbrary thus obtained was transformed into E. co„, duting which tfre an«otic 
reslce encoded by the bla-gene was seiected fo, in the presence of amp, . m . «M* 
about 15 o,000transfornun,s were obtained. These were washed from the plates with LB 
medium and stored at -MX after addition of glycerine. 

2. A Genomic 1.8 kb DNA Fragment from H. po^orpka Compacts the «r„7^ 
Phenotype in S. cerevisiae 

int o he S — strain RH2.S5 (MATa(Aaro7.:LEU2 «M9 ura3-52 ,eu2-3 - B 
ZsV in order ,o comment its Tyr/Phe auzotrophy, which is attribute to a deletton 
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in the AR07 gene. The transformed yeast cells were firstly selected for uracil prototrophy 
and thus for the presence of the URA3 gene of the vector pRS426 (SC-Ura) and then 
transferred to minimal medium (YNB + Trp + His). After about 5 incubation stages at 30°C, 
3 transformed yeast colonies were isolated which were capable of growing in the absence of 
uracil, tyrosine and phenylalanine. The plasmids were isolated and even after a retransform- 
ation were capable of complementing the AR07 deletion in RH2185. By restriction analysis, 
an approximately 5 kb additional DNA fragment was identified in the vector pRS426. For 
further localisation of the DNA fragment, a subcloning was performed wherein the fragments 
obtained were again ligated into the vector pRS426. These different plasmids were now, 
analogously to the plasmids of the genomic bank, examined for their ability to complement 
the Tyr/Phe auxotrophy inRH2185. An approximately 1.8 kb Apal/Xbal fragment was thus 
obtained, which was capable of doing this. This finding was confirmed by repeated retrans- 
formation. It was striking that the 1 .8 kb H. polymorpha DNA fragment (in the plasmid 
pME1525) in transformed yeast cells showed growth comparable with the wild type, while 
the 5 kb H. polymorpha fragment (in the plasmid pME1524) in transformed yeast cells 
resulted in markedly slower growth (figure 3). 

In order to confirm that the complementing 1 .8 kb fragment originated from Hcmsenula 
polymorpha, this was used as the probe in a Southern hybridisation. The chromosomal DNA 
from H. polymorpha, A. nidulans and S. cerevisiae was cleaved with the restriction enzyme 
EcoRV and after hybridisation with the probe a clear signal could be seen with H. poly- 
morpha, a weak signal with S. cerevisiae and no signal with A. nidulans. Furthermore, it 
could be shown by the Southern hybridisation that the gene occurs only once in the genome 
ofH. polymorpha (figure 4). 

The 1.8 kb DNA fragment from H. polymorpha RBI 1 was sequenced completely and an 
open reading frame consisting of 843 bp could be identified (figure 5), which showed a 58% 
agreement with the AR07 gene from S. cerevisiae. It can thus be assumed that this is the 
gene of the yeast Hansenula polymorpha homologous to AR07, which is referred to below as 
HAR07. 
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3. Preparation of Various Deletion Constructs 

On the basis of the sequence of the 1 .8 kb DNA fragment from H. polymorpha, the primers 
OLSK34 and OLSK35 were used for the preparation of various haro7A-deletion constructs. 
The various deletion constructs were transformed into H. polymorpha, in order to confirm the 
identity of the gene and to confirm that that no isoenzymes of chorismate mutase exist in//. 
polymorpha. 

By deletion of HAR07 in Hcmsermla polymorpha, the identity of this gene can be conclus- 
ely proved and the existence of alternative genes checked. At the same time, the construct- 
of such a deletion mutant opens up new selection possibilities. For the preparation of a 
haro7A deletion strain, three different disruption cassettes were constructed. For the 
necessary integration into the Hansenula polymorpha genome, the flanking regions of the 
HAR07 gene, which should effect the deletion of the HAR07 gene by homologous 
recombination, were used for all three constructs. The flanking region in the 3' direction was 
amplified using the primers OLSK34 and T7, and next cleaved with the restriction enzymes 
Apal and BglH. The flanking region in the 5' direction was amplified using the primers 
OLSK35 and T3, and cleaved with the restriction enzymes Xbal and BglH. This is possible 
as the primers OLSK34 and OLSK35 contain a BglH cleavage site (figure 6). 

Both fragments were ligated into the Apal/Xbal cleaved vector pBluescript KSH and then the 
vector was again opened with BglH, in order to introduce various disruption cassettes 
(figure 7). 

The hisG::URA3::hisG disruption cassette (Schneider et al., 1996) was cleaved with Bglll 
and ligated into the vector described above. By means of the URA3 marker, integration of 
the cassette into the genome of//, polymorpha can be checked. Whether a homologous 
integration, i.e. a deletion of HAR07, has in fact taken place can be shown by selection for 
phenylalanine/tyrosine auxotrophy and by Southern hybridisation. The hisG components 
favour the conclusive excision of the URA3 marker, which can be checked by counter- 
selection with 5-FOA (5-fluoroorotic acid), since the URA3 gene product causes a toxic 
conversion of 5-FOA so that only cells without the URA3 marker are capable of surviving 
(Boeke et al., 1984). 
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TheloxP.kanMX.io Bvm ea„ s ofthekanMXmaAer,.ntegnitionof 
^^anin^ne au.co.ropny a*d by S^*W~ 

the flanking IoxP regions. 

cassette <vju Miller oersonal communication). In tms 

e ene (=URA3) from tfanWa polymorpha (M Hiller, persona 

(Boeke et al., 1984). 

f * hur T -URA3hisG disruption cassette into H. polymorpha RBI 1 
and thus possessed the phenotype to be expected. 

F „„c,Uma. Cn.plemeaU.io" far Pmsmid Selection: 
pUsmid pFPMT121 from .he firm Rhein B,o.ech GmbH serves 
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bears the URA3 gene from S. cerevisiae for selection in H. po/^ after transformation 
bears the URA3 ge (Weydemann et al., 1995)(odcl orura3). 

of uracil-auxotrophic strains such as e.g. Ktsi i ^.wey 

The JU> promoter (BMW 299 108) from E «— * «» M ° X 

ffW—-* ,994), serves asthe expression modu,e. 

A*, * — >- enzymeNde,, a 5 xb DN A fragment can be isolated from 

Th i scome S ponds«othep.as mi d»i.hou.«heURA3-cod i n 8 se,ue„ce. Afrer 

PFPMT121 Tins c P tteatment tMs Mn ^ , |gated Wlth . 

fillineofthe overhangs of this UNA iragmciu y ... hears 

tilling or x The resulting plasmid now bears 

i i vh Fm72\ISspI fragment from pbKov tau^ u-> ^ i 

a. ^ m ,,t a QP The selection s performed for prototropny 
which codes for the enzyme chonsmate mutase. The 
with respect to tyrosine and phenylalanine. 

Alternatively the 1 .3 kb Eco72VSspl fragment can be cloned into the plasmid pFPMT 121 
marker genes, WM3 and ft4/?07. 

A recombmam DNA fragment for the expression of a ^ 

As an Ltple of such a conrro, seance, .he MFal gene from S ~ (Arnold * 
1998) may be mentioned. 

Wormed .0 .he heterologous expression plasmids and positive — » selected by 

inwhicnme i» exoreS sion of the heterologous protein 

pref erab. y used as ,be express** ^ ^ „ y cbMlging 

p^b.y .axes place afrer cu uru, -* ™ * ^ ^ 

,„ m edium wi,h glycenne as .he only C source. Unde ^ 
of the FMD promoter, linked with strong express** of the heterologous p 
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can be isolated in pure form from the purified culture supernatant by standard chromato- 
graphic methods. 

5. The 1.8 kb Fragment from Hansenula polymorpha Codes for a Protein with 
Chorismate Mutase Activity 

To check whether the 1.8 kb fragment contains a gene which codes for a protein with 
chorismate mutase activity, cell extracts of different plasmid-bearing yeast strains were 
prepared using a French Press and their chorismate mutase activities were measured. For the 
measurement of the specific enzyme activity, cell extracts from S. cerevisiae RH2185 
(PRS426), H. polymorphous, S. cerevisiae RH2185(pME1524) and S. cerevisiae 
RH2185(pME1525) were prepared. It could be shown that both the 5 kb fragment 

(PME1524) and also the 1.8 kb fragment (pME1525) from*, polymorpha code for genes for 

a chorismate mutase activity (table 4). 



' ;(-:ti^^han); f :,|tife 


r>; '. : , Specific^ 
V- ' (+ trypt 


ophan) m tysr r< 


S. cerevisiae- 
RH2185 + pRS426 


' 0 


5. cerevisiae- 
RH2185 + pRS426 


0 


H. polymorpha- 
RB11 


0.379 


K polymorpha- 
RB11 


0.349 


5. cerevisiae- 
RH2185 + pME1524 


" (U91 


S. cerevisiae- 
RH2185 + pME1524 


0.127 


S. cerevisiae- 
RH2185 + pME1525 


0.469 

1 


S. cerevisiae- 
RH2185 + pME1525 


0.263 

i 



T»b.4: Specific chorismate mutase activities (U/mg protein) oT WxWctt. 
The tabiejhow, the mean of JJ-Jgr * | J^^T* 
2? ^~uTT»rSe^; J» in each case peeked with ana 
without 500 joM tryptophan. 
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6. Identification of the HAR07 Gene of the Yeast Hansertula polymorphic 

The complementing 1.8 kb DNA fragment was completely sequenced using a T3 and a T7 
primer (figure 5). 

The enzyme chorismate mutase is encoded in the yeast Hansenula polymorpha by a gene 
which consists of 843 bp and has been named HAR07. This gene was cloned by 
complementing the phenylalanine/tyrosine auxotrophy of a Saccharomyces cerevisiae aro7A 
deletion strain with a genomic gene bank from Hansenula polymorpha. Hence the H. 
polymorpha promoter is functional in S. cerevisiae. No introns are present, and a 32 kDA 
protein consisting of 280 amino acids can be derived from the sequence. The amino acid 
sequence of the chorismate mutase from Hansenula polymorpha shows 58% identity to that 
from Saccharomyces cerevisiae and 44% to that from Aspergillus nidulans. A striking 
feature in the comparison of these amino acid sequences is an endpiece consisting of 23 
amino acids in the sequence from H. polymorpha, which is present neither in S. cerevisiae 
nor in A. nidulans, nor in other eukaryotic chorismate mutase amino acid sequences. 
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